GitHub - aaronzweig/third-person

Reference implementation of Offline Permuted Policy Learning from "Provably Efficient Third-Person Imitation from Offline Observation" in UAI 2020.

Credit to Tim Vieira for the arsenal package, and to Tim Vieira and Kianté Brantley for the tabular RL testbed in the files {markovchain, mdp, mrp}.py (also at https://github.com/timvieira/rl)

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
arsenal		arsenal
README.md		README.md
markovchain.py		markovchain.py
mdp.py		mdp.py
mrp.py		mrp.py
third_person_experiments.ipynb		third_person_experiments.ipynb
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

arsenal

arsenal

README.md

README.md

markovchain.py

markovchain.py

mdp.py

mdp.py

mrp.py

mrp.py

third_person_experiments.ipynb

third_person_experiments.ipynb

utils.py

utils.py

Repository files navigation

About

Releases

Packages

Languages

aaronzweig/third-person

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Languages